Overview

Dataset statistics

Number of variables20
Number of observations26148
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.0 MiB
Average record size in memory401.5 B

Variable types

NUM12
CAT8

Reproduction

Analysis started2022-12-10 15:39:12.572827
Analysis finished2022-12-10 15:39:45.121895
Duration32.55 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Year has constant value "2016" Constant
Data has a high cardinality: 118 distinct values High cardinality
Time has a high cardinality: 7675 distinct values High cardinality
TimeSunRise has a high cardinality: 51 distinct values High cardinality
TimeSunSet has a high cardinality: 56 distinct values High cardinality
SunPerDay has a high cardinality: 75 distinct values High cardinality
Month is highly correlated with Id and 1 other fieldsHigh correlation
Id is highly correlated with MonthHigh correlation
UNIXTime is highly correlated with MonthHigh correlation
Month is highly correlated with TimeSunRise and 1 other fieldsHigh correlation
TimeSunRise is highly correlated with Month and 1 other fieldsHigh correlation
SunPerDay is highly correlated with TimeSunSet and 2 other fieldsHigh correlation
TimeSunSet is highly correlated with SunPerDay and 1 other fieldsHigh correlation
SunPerDayHours is highly correlated with TimeSunRise and 2 other fieldsHigh correlation
Id is uniformly distributed Uniform
Id has unique values Unique
UNIXTime has unique values Unique
Speed has 375 (1.4%) zeros Zeros
Hour has 1003 (3.8%) zeros Zeros
Minute has 2158 (8.3%) zeros Zeros

Variables

Id
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE
Distinct count26148
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16337.57866758452
Minimum0
Maximum32685
Zeros1
Zeros (%)< 0.1%
Memory size204.4 KiB

Quantile statistics

Minimum0
5-th percentile1608.35
Q18125.75
median16379
Q324507.25
95-th percentile31048.65
Maximum32685
Range32685
Interquartile range (IQR)16381.5

Descriptive statistics

Standard deviation9449.975676
Coefficient of variation (CV)0.5784195974
Kurtosis-1.202139245
Mean16337.57867
Median Absolute Deviation (MAD)8190.5
Skewness-0.002733428249
Sum427195007
Variance89302040.28
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4152 1 < 0.1%
 
32190 1 < 0.1%
 
10875 1 < 0.1%
 
11572 1 < 0.1%
 
739 1 < 0.1%
 
1545 1 < 0.1%
 
9327 1 < 0.1%
 
5127 1 < 0.1%
 
24546 1 < 0.1%
 
23004 1 < 0.1%
 
Other values (26138) 26138 > 99.9%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
32685 1 < 0.1%
 
32684 1 < 0.1%
 
32682 1 < 0.1%
 
32681 1 < 0.1%
 
32680 1 < 0.1%
 

UNIXTime
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count26148
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1478046152.704528
Minimum1472724008
Maximum1483264501
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum1472724008
5-th percentile1473229431
Q11475532770
median1478038068
Q31480478477
95-th percentile1482765838
Maximum1483264501
Range10540493
Interquartile range (IQR)4945706.5

Descriptive statistics

Standard deviation3005884.635
Coefficient of variation (CV)0.00203368794
Kurtosis-1.142572373
Mean1478046153
Median Absolute Deviation (MAD)2501998.5
Skewness0.006375119015
Sum3.86479508e+13
Variance9.035342439e+12
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1473879005 1 < 0.1%
 
1480734907 1 < 0.1%
 
1476954020 1 < 0.1%
 
1476744921 1 < 0.1%
 
1475004321 1 < 0.1%
 
1474752322 1 < 0.1%
 
1477419918 1 < 0.1%
 
1473579921 1 < 0.1%
 
1483257301 1 < 0.1%
 
1478451005 1 < 0.1%
 
Other values (26138) 26138 > 99.9%
 
ValueCountFrequency (%) 
1472724008 1 < 0.1%
 
1472724310 1 < 0.1%
 
1472725505 1 < 0.1%
 
1472725809 1 < 0.1%
 
1472726704 1 < 0.1%
 
ValueCountFrequency (%) 
1483264501 1 < 0.1%
 
1483264203 1 < 0.1%
 
1483263904 1 < 0.1%
 
1483263601 1 < 0.1%
 
1483263302 1 < 0.1%
 

Data
Categorical

HIGH CARDINALITY
Distinct count118
Unique (%)0.5%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
9/23/2016 12:00:00 AM
 
245
10/27/2016 12:00:00 AM
 
243
11/18/2016 12:00:00 AM
 
242
11/3/2016 12:00:00 AM
 
241
11/19/2016 12:00:00 AM
 
240
Other values (113)
24937
ValueCountFrequency (%) 
9/23/2016 12:00:00 AM 245 0.9%
 
10/27/2016 12:00:00 AM 243 0.9%
 
11/18/2016 12:00:00 AM 242 0.9%
 
11/3/2016 12:00:00 AM 241 0.9%
 
11/19/2016 12:00:00 AM 240 0.9%
 
12/16/2016 12:00:00 AM 239 0.9%
 
12/23/2016 12:00:00 AM 239 0.9%
 
12/19/2016 12:00:00 AM 239 0.9%
 
12/2/2016 12:00:00 AM 238 0.9%
 
9/29/2016 12:00:00 AM 238 0.9%
 
Other values (108) 23744 90.8%
 

Length

Max length22
Mean length21.49395747
Min length20
ValueCountFrequency (%) 
Decimal_Number 10 66.7%
 
Other_Punctuation 2 13.3%
 
Uppercase_Letter 2 13.3%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Common 13 86.7%
 
Latin 2 13.3%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Time
Categorical

HIGH CARDINALITY
Distinct count7675
Unique (%)29.4%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
02:05:02
 
18
06:05:18
 
18
12:25:18
 
17
09:05:18
 
17
03:45:18
 
17
Other values (7670)
26061
ValueCountFrequency (%) 
02:05:02 18 0.1%
 
06:05:18 18 0.1%
 
12:25:18 17 0.1%
 
09:05:18 17 0.1%
 
03:45:18 17 0.1%
 
12:45:18 17 0.1%
 
20:05:18 17 0.1%
 
08:30:18 16 0.1%
 
20:35:18 16 0.1%
 
14:55:18 16 0.1%
 
Other values (7665) 25979 99.4%
 

Length

Max length8
Mean length8
Min length8
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

Temperature
Real number (ℝ≥0)

Distinct count38
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.1048646167967
Minimum34
Maximum71
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum34
5-th percentile43
Q146
median50
Q355
95-th percentile63
Maximum71
Range37
Interquartile range (IQR)9

Descriptive statistics

Standard deviation6.213912409
Coefficient of variation (CV)0.1215914073
Kurtosis-0.3191962441
Mean51.10486462
Median Absolute Deviation (MAD)4
Skewness0.5176414002
Sum1336290
Variance38.61270743
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
45 2358 9.0%
 
48 2085 8.0%
 
47 2027 7.8%
 
46 1706 6.5%
 
50 1658 6.3%
 
49 1610 6.2%
 
51 1543 5.9%
 
52 1218 4.7%
 
53 1071 4.1%
 
54 1011 3.9%
 
Other values (28) 9861 37.7%
 
ValueCountFrequency (%) 
34 1 < 0.1%
 
35 7 < 0.1%
 
36 36 0.1%
 
37 79 0.3%
 
38 81 0.3%
 
ValueCountFrequency (%) 
71 13 < 0.1%
 
70 33 0.1%
 
69 30 0.1%
 
68 44 0.2%
 
67 69 0.3%
 

Pressure
Real number (ℝ≥0)

Distinct count38
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.42283501606241
Minimum30.19
Maximum30.56
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum30.19
5-th percentile30.3
Q130.4
median30.43
Q330.46
95-th percentile30.49
Maximum30.56
Range0.37
Interquartile range (IQR)0.06

Descriptive statistics

Standard deviation0.05472382402
Coefficient of variation (CV)0.001798774637
Kurtosis2.119855824
Mean30.42283502
Median Absolute Deviation (MAD)0.03
Skewness-1.235212722
Sum795496.29
Variance0.002994696916
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
30.44 2652 10.1%
 
30.43 2525 9.7%
 
30.45 2449 9.4%
 
30.42 2304 8.8%
 
30.46 2164 8.3%
 
30.41 1872 7.2%
 
30.47 1837 7.0%
 
30.4 1593 6.1%
 
30.48 1299 5.0%
 
30.39 1183 4.5%
 
Other values (28) 6270 24.0%
 
ValueCountFrequency (%) 
30.19 13 < 0.1%
 
30.2 33 0.1%
 
30.21 39 0.1%
 
30.22 36 0.1%
 
30.23 65 0.2%
 
ValueCountFrequency (%) 
30.56 26 0.1%
 
30.55 23 0.1%
 
30.54 65 0.2%
 
30.53 43 0.2%
 
30.52 151 0.6%
 

Humidity
Real number (ℝ≥0)

Distinct count93
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75.00523940645556
Minimum11
Maximum103
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum11
5-th percentile25
Q156
median85
Q397
95-th percentile102
Maximum103
Range92
Interquartile range (IQR)41

Descriptive statistics

Standard deviation25.9931019
Coefficient of variation (CV)0.3465504824
Kurtosis-0.7543674338
Mean75.00523941
Median Absolute Deviation (MAD)15
Skewness-0.7746688274
Sum1961237
Variance675.6413463
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
102 1690 6.5%
 
101 1591 6.1%
 
93 1437 5.5%
 
100 1218 4.7%
 
99 969 3.7%
 
98 757 2.9%
 
97 643 2.5%
 
96 594 2.3%
 
95 560 2.1%
 
94 483 1.8%
 
Other values (83) 16206 62.0%
 
ValueCountFrequency (%) 
11 4 < 0.1%
 
12 11 < 0.1%
 
13 36 0.1%
 
14 36 0.1%
 
15 61 0.2%
 
ValueCountFrequency (%) 
103 170 0.7%
 
102 1690 6.5%
 
101 1591 6.1%
 
100 1218 4.7%
 
99 969 3.7%
 

WindDirection(Degrees)
Real number (ℝ≥0)

Distinct count15777
Unique (%)60.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean143.25996978736424
Minimum0.09
Maximum359.95
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum0.09
5-th percentile16.937
Q181.9775
median147.645
Q3179.22
95-th percentile323.3665
Maximum359.95
Range359.86
Interquartile range (IQR)97.2425

Descriptive statistics

Standard deviation82.98817746
Coefficient of variation (CV)0.5792837844
Kurtosis0.2319634337
Mean143.2599698
Median Absolute Deviation (MAD)42.37
Skewness0.5628664262
Sum3745961.69
Variance6887.037597
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.11 76 0.3%
 
359.93 49 0.2%
 
359.92 39 0.1%
 
0.1 31 0.1%
 
0.12 29 0.1%
 
85.39 17 0.1%
 
191.13 12 < 0.1%
 
359.91 12 < 0.1%
 
25.99 12 < 0.1%
 
177.56 12 < 0.1%
 
Other values (15767) 25859 98.9%
 
ValueCountFrequency (%) 
0.09 2 < 0.1%
 
0.1 31 0.1%
 
0.11 76 0.3%
 
0.12 29 0.1%
 
0.13 8 < 0.1%
 
ValueCountFrequency (%) 
359.95 1 < 0.1%
 
359.94 3 < 0.1%
 
359.93 49 0.2%
 
359.92 39 0.1%
 
359.91 12 < 0.1%
 

Speed
Real number (ℝ≥0)

ZEROS
Distinct count36
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.248521493039621
Minimum0.0
Maximum39.37
Zeros375
Zeros (%)1.4%
Memory size204.4 KiB

Quantile statistics

Minimum0
5-th percentile1.12
Q13.37
median5.62
Q37.87
95-th percentile12.37
Maximum39.37
Range39.37
Interquartile range (IQR)4.5

Descriptive statistics

Standard deviation3.484165777
Coefficient of variation (CV)0.5575984304
Kurtosis6.910850573
Mean6.248521493
Median Absolute Deviation (MAD)2.25
Skewness1.499486136
Sum163386.34
Variance12.13941116
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5.62 3680 14.1%
 
4.5 3612 13.8%
 
6.75 3482 13.3%
 
3.37 3093 11.8%
 
7.87 2858 10.9%
 
2.25 2201 8.4%
 
9 2035 7.8%
 
10.12 1374 5.3%
 
1.12 1102 4.2%
 
11.25 865 3.3%
 
Other values (26) 1846 7.1%
 
ValueCountFrequency (%) 
0 375 1.4%
 
1.12 1102 4.2%
 
2.25 2201 8.4%
 
3.37 3093 11.8%
 
4.5 3612 13.8%
 
ValueCountFrequency (%) 
39.37 1 < 0.1%
 
38.25 2 < 0.1%
 
37.12 2 < 0.1%
 
36 3 < 0.1%
 
34.87 3 < 0.1%
 

TimeSunRise
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count51
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
06:11:00
 
1116
06:08:00
 
1065
06:10:00
 
924
06:13:00
 
921
06:12:00
 
912
Other values (46)
21210
ValueCountFrequency (%) 
06:11:00 1116 4.3%
 
06:08:00 1065 4.1%
 
06:10:00 924 3.5%
 
06:13:00 921 3.5%
 
06:12:00 912 3.5%
 
06:16:00 898 3.4%
 
06:14:00 889 3.4%
 
06:09:00 824 3.2%
 
06:56:00 690 2.6%
 
06:23:00 686 2.6%
 
Other values (41) 17223 65.9%
 

Length

Max length8
Mean length8
Min length8
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

TimeSunSet
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count56
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
17:42:00
 
2961
17:43:00
 
1829
17:44:00
 
1491
17:46:00
 
1391
17:45:00
 
1140
Other values (51)
17336
ValueCountFrequency (%) 
17:42:00 2961 11.3%
 
17:43:00 1829 7.0%
 
17:44:00 1491 5.7%
 
17:46:00 1391 5.3%
 
17:45:00 1140 4.4%
 
17:47:00 935 3.6%
 
17:51:00 925 3.5%
 
17:48:00 923 3.5%
 
17:49:00 911 3.5%
 
17:54:00 908 3.5%
 
Other values (46) 12734 48.7%
 

Length

Max length8
Mean length8
Min length8
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

Radiation
Real number (ℝ≥0)

Distinct count11826
Unique (%)45.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean208.04478048034267
Minimum1.13
Maximum1601.26
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum1.13
5-th percentile1.19
Q11.23
median2.71
Q3358.945
95-th percentile905.965
Maximum1601.26
Range1600.13
Interquartile range (IQR)357.715

Descriptive statistics

Standard deviation316.0902472
Coefficient of variation (CV)1.519337551
Kurtosis0.4816721632
Mean208.0447805
Median Absolute Deviation (MAD)1.53
Skewness1.358960261
Sum5439954.92
Variance99913.04436
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.22 1751 6.7%
 
1.23 1692 6.5%
 
1.21 1623 6.2%
 
1.24 1357 5.2%
 
1.2 1247 4.8%
 
1.25 949 3.6%
 
1.19 773 3.0%
 
1.26 586 2.2%
 
1.18 385 1.5%
 
1.27 367 1.4%
 
Other values (11816) 15418 59.0%
 
ValueCountFrequency (%) 
1.13 3 < 0.1%
 
1.14 4 < 0.1%
 
1.15 35 0.1%
 
1.16 68 0.3%
 
1.17 176 0.7%
 
ValueCountFrequency (%) 
1601.26 1 < 0.1%
 
1475.4 1 < 0.1%
 
1451.41 1 < 0.1%
 
1410.52 1 < 0.1%
 
1387.17 1 < 0.1%
 

Year
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
2016
26148
ValueCountFrequency (%) 
2016 26148 100.0%
 

Length

Max length4
Mean length4
Min length4
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Month
Categorical

HIGH CORRELATION
HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
10
6995
11
6657
12
6527
9
5969
ValueCountFrequency (%) 
10 6995 26.8%
 
11 6657 25.5%
 
12 6527 25.0%
 
9 5969 22.8%
 

Length

Max length2
Mean length1.771722503
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Day
Real number (ℝ≥0)

Distinct count31
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.828935291418082
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Memory size204.4 KiB

Quantile statistics

Minimum1
5-th percentile2
Q19
median16
Q323
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.706177276
Coefficient of variation (CV)0.5500166066
Kurtosis-1.171341809
Mean15.82893529
Median Absolute Deviation (MAD)7
Skewness-0.06613776473
Sum413895
Variance75.79752276
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
23 947 3.6%
 
2 937 3.6%
 
22 931 3.6%
 
27 931 3.6%
 
19 929 3.6%
 
18 919 3.5%
 
14 915 3.5%
 
17 913 3.5%
 
3 913 3.5%
 
20 912 3.5%
 
Other values (21) 16901 64.6%
 
ValueCountFrequency (%) 
1 875 3.3%
 
2 937 3.6%
 
3 913 3.5%
 
4 908 3.5%
 
5 873 3.3%
 
ValueCountFrequency (%) 
31 436 1.7%
 
30 456 1.7%
 
29 886 3.4%
 
28 907 3.5%
 
27 931 3.6%
 

Hour
Real number (ℝ≥0)

ZEROS
Distinct count24
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.523022793330274
Minimum0
Maximum23
Zeros1003
Zeros (%)3.8%
Memory size204.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q16
median12
Q318
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.917195806
Coefficient of variation (CV)0.6002935107
Kurtosis-1.209506317
Mean11.52302279
Median Absolute Deviation (MAD)6
Skewness0.001670671384
Sum301304
Variance47.84759782
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
13 1134 4.3%
 
2 1118 4.3%
 
4 1113 4.3%
 
3 1111 4.2%
 
22 1107 4.2%
 
1 1103 4.2%
 
6 1101 4.2%
 
14 1101 4.2%
 
20 1101 4.2%
 
8 1098 4.2%
 
Other values (14) 15061 57.6%
 
ValueCountFrequency (%) 
0 1003 3.8%
 
1 1103 4.2%
 
2 1118 4.3%
 
3 1111 4.2%
 
4 1113 4.3%
 
ValueCountFrequency (%) 
23 1094 4.2%
 
22 1107 4.2%
 
21 1083 4.1%
 
20 1101 4.2%
 
19 1096 4.2%
 

Minute
Real number (ℝ≥0)

ZEROS
Distinct count26
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.522755086431086
Minimum0
Maximum57
Zeros2158
Zeros (%)8.3%
Memory size204.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q115
median30
Q341
95-th percentile55
Maximum57
Range57
Interquartile range (IQR)26

Descriptive statistics

Standard deviation17.25434808
Coefficient of variation (CV)0.626912096
Kurtosis-1.214326679
Mean27.52275509
Median Absolute Deviation (MAD)15
Skewness-0.002075381756
Sum719665
Variance297.7125277
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
35 2194 8.4%
 
50 2165 8.3%
 
40 2161 8.3%
 
25 2159 8.3%
 
0 2158 8.3%
 
10 2156 8.2%
 
30 2143 8.2%
 
15 2140 8.2%
 
55 2139 8.2%
 
5 2132 8.2%
 
Other values (16) 4601 17.6%
 
ValueCountFrequency (%) 
0 2158 8.3%
 
1 29 0.1%
 
2 1 < 0.1%
 
5 2132 8.2%
 
6 29 0.1%
 
ValueCountFrequency (%) 
57 3 < 0.1%
 
56 30 0.1%
 
55 2139 8.2%
 
51 33 0.1%
 
50 2165 8.3%
 

Second
Real number (ℝ≥0)

Distinct count60
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.44752944775891
Minimum0
Maximum59
Zeros2
Zeros (%)< 0.1%
Memory size204.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q15
median18
Q322
95-th percentile49
Maximum59
Range59
Interquartile range (IQR)17

Descriptive statistics

Standard deviation12.91335632
Coefficient of variation (CV)0.7401252057
Kurtosis0.557898701
Mean17.44752945
Median Absolute Deviation (MAD)7
Skewness0.8325560576
Sum456218
Variance166.7547715
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
18 3040 11.6%
 
2 2603 10.0%
 
20 2131 8.1%
 
17 1838 7.0%
 
21 1714 6.6%
 
19 1362 5.2%
 
4 1357 5.2%
 
1 1325 5.1%
 
22 1152 4.4%
 
5 1116 4.3%
 
Other values (50) 8510 32.5%
 
ValueCountFrequency (%) 
0 2 < 0.1%
 
1 1325 5.1%
 
2 2603 10.0%
 
3 945 3.6%
 
4 1357 5.2%
 
ValueCountFrequency (%) 
59 8 < 0.1%
 
58 16 0.1%
 
57 16 0.1%
 
56 32 0.1%
 
55 44 0.2%
 

SunPerDay
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count75
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
0 days 10:56:00
 
3242
0 days 10:57:00
 
1811
0 days 11:22:00
 
472
0 days 11:12:00
 
471
0 days 11:00:00
 
471
Other values (70)
19681
ValueCountFrequency (%) 
0 days 10:56:00 3242 12.4%
 
0 days 10:57:00 1811 6.9%
 
0 days 11:22:00 472 1.8%
 
0 days 11:12:00 471 1.8%
 
0 days 11:00:00 471 1.8%
 
0 days 11:04:00 470 1.8%
 
0 days 11:37:00 468 1.8%
 
0 days 11:34:00 468 1.8%
 
0 days 11:07:00 467 1.8%
 
0 days 11:08:00 467 1.8%
 
Other values (65) 17341 66.3%
 

Length

Max length15
Mean length15
Min length15
ValueCountFrequency (%) 
Decimal_Number 10 62.5%
 
Lowercase_Letter 4 25.0%
 
Space_Separator 1 6.2%
 
Other_Punctuation 1 6.2%
 
ValueCountFrequency (%) 
Common 12 75.0%
 
Latin 4 25.0%
 
ValueCountFrequency (%) 
ASCII 16 100.0%
 

SunPerDayHours
Categorical

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size204.4 KiB
11
17051
12
9097
ValueCountFrequency (%) 
11 17051 65.2%
 
12 9097 34.8%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

IdUNIXTimeDataTimeTemperaturePressureHumidityWindDirection(Degrees)SpeedTimeSunRiseTimeSunSetRadiationYearMonthDayHourMinuteSecondSunPerDaySunPerDayHours
0415214738790059/14/2016 12:00:00 AM08:50:055730.456826.704.5006:10:0018:26:00680.04201691485050 days 12:16:0012
113047147629312110/12/2016 12:00:00 AM07:25:215030.4796144.9610.1206:16:0018:02:00277.3720161012725210 days 11:46:0012
27420147799322010/31/2016 12:00:00 AM23:40:204730.4856119.523.3706:23:0017:49:001.29201610312340200 days 11:26:0011
3650814730135059/4/2016 12:00:00 AM08:25:055730.479338.612.2506:08:0018:35:00544.7520169482550 days 12:27:0012
429110148188543412/16/2016 12:00:00 AM00:50:344130.23103177.552.2506:50:0017:46:001.2220161216050340 days 10:56:0011
525348148301554312/29/2016 12:00:00 AM02:45:433730.3554177.006.7506:56:0017:53:001.1520161229245430 days 10:57:0011
613599147612482310/10/2016 12:00:00 AM08:40:236030.44474.881.1206:16:0018:03:00578.6020161010840230 days 11:47:0012
720795147911370211/13/2016 12:00:00 AM22:55:024530.4783114.396.7506:30:0017:44:001.2020161113225520 days 11:14:0011
828143148217641712/19/2016 12:00:00 AM09:40:175330.5384154.097.8706:52:0017:48:00269.5320161219940170 days 10:56:0011
915824147544232110/2/2016 12:00:00 AM11:05:215730.479940.611.1206:14:0018:10:00942.252016102115210 days 11:56:0012

Last rows

IdUNIXTimeDataTimeTemperaturePressureHumidityWindDirection(Degrees)SpeedTimeSunRiseTimeSunSetRadiationYearMonthDayHourMinuteSecondSunPerDaySunPerDayHours
26138626514730891119/5/2016 12:00:00 AM05:25:114730.4073179.017.8706:08:0018:35:002.69201695525110 days 12:27:0012
2613922118147871680511/9/2016 12:00:00 AM08:40:055430.4651337.082.2506:28:0017:45:00484.33201611984050 days 11:17:0011
2614011284147683131810/18/2016 12:00:00 AM12:55:185830.4198350.199.0006:18:0017:57:00556.69201610181255180 days 11:39:0011
2614111964147662522310/16/2016 12:00:00 AM03:40:235030.4924158.705.6206:18:0017:59:001.1920161016340230 days 11:41:0011
2614221575147887970211/11/2016 12:00:00 AM05:55:024630.4736180.236.7506:29:0017:44:001.232016111155520 days 11:15:0011
2614329802148167630912/13/2016 12:00:00 AM14:45:095030.2896304.2212.3706:48:0017:45:00216.2920161213144590 days 10:57:0011
26144539014734260259/9/2016 12:00:00 AM03:00:254430.37100162.803.3706:09:0018:31:001.4720169930250 days 12:22:0012
2614586014749665199/26/2016 12:00:00 AM22:55:194830.4264158.904.5006:12:0018:15:001.2020169262255190 days 12:03:0012
2614615795147545102110/2/2016 12:00:00 AM13:30:215630.429955.7213.5006:14:0018:10:00659.1220161021330210 days 11:56:0012
2614723654147825572111/4/2016 12:00:00 AM00:35:214530.4556136.115.6206:25:0017:47:001.182016114035210 days 11:22:0011